CDS

Accession Number TCMCG002C05771
gbkey CDS
Protein Id XP_020084485.1
Location join(14983143..14983235,14983316..14983351,14983435..14983547,14984224..14984269,14984513..14984559,14984808..14984915,14986397..14986472,14987596..14987668,14987767..14987912,14988116..14988201,14988365..14988504,14988571..14988710,14988788..14988838,14989965..14990028,14990835..14990978,14991043..14991110,14991199..14991257,14991387..14991543,14993039..14993091,14993184..14993282,14993666..14993780,14993879..14994001,14994286..14994456)
Gene LOC109707553
GeneID 109707553
Organism Ananas comosus

Protein

Length 735aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA371634
db_source XM_020228896.1
Definition DNA mismatch repair protein MSH4 isoform X1 [Ananas comosus]

EGGNOG-MAPPER Annotation

COG_category L
Description DNA mismatch repair protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGTAACAATTATTTCCCCAATCAAATTGGCACCGGATGGCATGATGGGAGTGTCTGAGCTGGTTGATAAGCATTATTCATTGAACAAAAAGATCATAATGTCTCGCGGCTGCTTTGATGATACAAAGGGTGGTCTCTTGGTCAAAAATTTGTCAGCGAAGGAGCCATCTACTCTTGGTTTAGATACTTATTGCAAGCAATATTATCTCTGCCTCGCTGCTGCTGCTGCTACCATCAAATGGACTGAAGCAGAGAAAGGCATAATAGTAACAAATCACTCACTGTCGGTTACGTTCAATGGTTCATTTGATCACATGAATATTGATGCTACTAGTGTTCAAAATTTAGAAATTATTGACCCTCTACATACCGAACTGTGGGGTTCTAGCAACAAGAAGAGAAGCCTATTCCGGATGTTGAAGACGACGAAGACTATAGGAGGGGCTAGACTACTTCGAGCCAACTTACTACAACCATTAAAGGATATGGAAACTATCAATGCCCGTCTTGATTGCTTAGATGAACTAATGAGCAATGAAGAGTTGTTCTTTGGACTGACGCAGGGTCTTCGAAAATTTCCAAAAGAGACAGACAAGGTTCTTTGTCACTTTTGTTTCAAACCCAGAAAGGTCACAGAGGAAGTTTTGAAGTCTGTAAATGGTAGAAAGAGTCAAATGCTGATTTCAGACATTATTGTTCTTAAAACAGCTCTGGATGCCATACCCTTTCTTTCCAAGGTTCTCAAGGATGCAAAATGTTCCCTTCTTTGCAACATTTACCGCACTGTTTGTGAAAATCAGAAATATTCAAACATGAGAAACAGAATTAGGGATGTAATTGATGAAGATGTGATACATGCAAGGGCTCCTTTTGTTGCTTGCACACAGCAGTGTTTTGCTATCAAAGCTGGAATAGATGGACTTCTTGATGTTGCACGCCGTTCCTTCTGTGACACAAGTGAAGCTATACATAATCTTGCAAACAAATACCGGGAGGAATTTAATCTGCCAAATTTGAAGATCCCATACAACAATAGGCAGGGATTTTACTTCAGTATTCCACAGAAGGACACATCTGGAAAGCTTCCTAATACATTTATTCAGGTCATGAAACATGGGAAAAACATACACTGCTCAAGTTTTGAACTTGCCTCTCTGAATGTTAGGAATAAATCAGCTTCTGCCGAATGCTTTTTGCGCACAGAACATTGTTTGGAAGGGCTGATTGATGCAATAAGGCAGGATATCTCCATACTAACATTGCTTGCAGAAGTCTTATGCCTTCTAGACATGATTGTGAATTCATTTGCGCACAGTATTTCAACTAAGCCTGTTGACCGCTACACAAGACCAGAGTTTACAGATAATGGACCAATGGCAATTGATGCTGGCAGACACCCTATACTGGAAAGCCTACACACTGATTTTGTTCCTAATAATCTTTTTCTCTCTGAAGCATCTAATATGGTGATTGTCATGGGCCCCAACATGAGCGGAAAAAGCACTTATCTTCAACAGATTTGTCTAATAGTCATCCTTGCACAAATCGGATGTTATGTCCCTGCTCGTTTTGCATCTCTAAGAGTGGTTGATCGCATATTCACACGGATTGGGACTGGAGATAATGTTGAATACAATTCTAGCACTTTTATGACGGAAATGAAAGAGACAGCTTTCCTCATGCAAAATGTGTCCCCAAAGAGCCTGGTTGTTATGGATGAGCTGGGAAGGGCAACTTCTTCCTCTGATGGGTTTGCAATTGCTTGGAGCTGTTGCGAGCATCTGCTAACTCTCAAAGCGTACACTATATTTGCTACGCATATGGAGGGTCTATCTGAACTTGCAACCATCTATCCTAATGTGAAGATTCTTCATTTTGAGGTCGACCTACGCAACAATCGCTTAGATTTCAAGTTTCGTCTCAAAGATGGTCCAAGGAGGGTGCCACACTACGGTCTTCTATTGGCTGGTGTTGCAGGTCTACCAAGCTCCGTGGTGGAGACAGCAAGGAACATTACTTCAAGAATCACAGAAGAGGAAATGAGGAGAATGAATATCAACTTTGAGCAGTATCACTCAATTCAGATGGCGTACCGGGTTGCACAAAGGCTGATCTGCTTGAAATATTCCAACCAAGGCGAGGATTACATCCGCCAAGCGCTGCAGAATCTGAAGGAGAGCTACAATGAGGGTCGGTTTACATGCTGA
Protein:  
MVTIISPIKLAPDGMMGVSELVDKHYSLNKKIIMSRGCFDDTKGGLLVKNLSAKEPSTLGLDTYCKQYYLCLAAAAATIKWTEAEKGIIVTNHSLSVTFNGSFDHMNIDATSVQNLEIIDPLHTELWGSSNKKRSLFRMLKTTKTIGGARLLRANLLQPLKDMETINARLDCLDELMSNEELFFGLTQGLRKFPKETDKVLCHFCFKPRKVTEEVLKSVNGRKSQMLISDIIVLKTALDAIPFLSKVLKDAKCSLLCNIYRTVCENQKYSNMRNRIRDVIDEDVIHARAPFVACTQQCFAIKAGIDGLLDVARRSFCDTSEAIHNLANKYREEFNLPNLKIPYNNRQGFYFSIPQKDTSGKLPNTFIQVMKHGKNIHCSSFELASLNVRNKSASAECFLRTEHCLEGLIDAIRQDISILTLLAEVLCLLDMIVNSFAHSISTKPVDRYTRPEFTDNGPMAIDAGRHPILESLHTDFVPNNLFLSEASNMVIVMGPNMSGKSTYLQQICLIVILAQIGCYVPARFASLRVVDRIFTRIGTGDNVEYNSSTFMTEMKETAFLMQNVSPKSLVVMDELGRATSSSDGFAIAWSCCEHLLTLKAYTIFATHMEGLSELATIYPNVKILHFEVDLRNNRLDFKFRLKDGPRRVPHYGLLLAGVAGLPSSVVETARNITSRITEEEMRRMNINFEQYHSIQMAYRVAQRLICLKYSNQGEDYIRQALQNLKESYNEGRFTC